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A METHOD OF SCREENING FOR PROTEIN SECRETING RECOMBINANT / ftd&T CELLS 

Field of invention 

The invention describes a method of screening for protein secreting recombinant host 
5 cells. The method can be used for rapid identification of actively secreting transformants and 
can be used to screen recombinant libraries for transformants secreting proteins. 

Background of the invention 

Proteins which are secreted are highly interesting for use in industrial applications. A 
1 0 positive selection screening system which selects only host cells secreting proteins is thus very 
desirable. 

Signal trapping is a method to identify genes containing a signal peptide using a 
translational fusion to an extracellular reporter gene lacking its own signal. This has been 
reported in the literature for the purpose of identifying new signal sequences (Smith, H. et al., 

15 1987, Construction and use of signal sequence selection vectors in Escherichia coli and 
Bacillus subtilis. J. Bact. 169:3321-3328), also the use of such for defining clearly the specific 
elements within signal peptides which are required for optimal function (Smith, H. et al, 1988. 
Characterisation of signal-sequence-coding regions selected from the Bacillus subtilis 
chromosome. Gene. 70:351-361). 

20 A further development, signal sequence trapping, has been described in WO 01/77315 

(Novozymes A/S). 

HtrA-type serine proteases participate in folding and degradation of aberrant proteins 
and in processing and maturation of native proteins (Pallen MJ; Wren BW (1997): The HtrA 
family of serine proteases. Molecular microbiology 26: 209-221). It has been shown that the 

25 Bacillus subtilis YkdA and YvtA , members of this family are induced by secretion stress; when 
cells are expressing and secreting heterologous amylases (Noone D, Howell A, Collery R, and 
Kevin M. Devine (2001): YkdA and YvtA, HtrA-Like Serine Proteases in Bacillus subtilis. 
Engage in Negative Autoregulation and Reciprocal Cross-Regulation of ykdA and yvtA Gene 
Expression. Journal of Bacteriology 183: 654-663). This secretion stress induction happens at 

30 the transcriptional level. 

Summary of the invention 

The problem to be solved by the present invention is to identify those samples in a 
collection of host cells that efficiently secrete polypeptides, e.g. enzymes, even enzymes with 
35 unknown activity, without having to screen the collection by traditional labour- and 
timeconsuming techniques like plasmid or genome analysis to find host cells that contain the 
right gene insert, thereafter to culture the selected host cells in liquid media and perform SDS- 
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gel analysis on the host cell samples to identify the ones that are secreting recombinant 
protein. 

We describe the introduction of an inducible promoter operably linked to a reporter 
gene into a host cell, the host cell further comprising a nucleic acid sequence of interest. The 
5 said construct may conveniently be used to screen for recombinant host cells that are 
secreting protein by colony colour, measuring clearing zones in substrate agars or gels, or by 
monitoring product formation in culturing supernatant. The invention is applicable both in 
expression cloning and in library screening. 

10 Accordingly in a first aspect, the invention relates to a method of screening for protein 

secreting recombinant host cells comprising screening for promoter activity of a stress 
inducible promoter 

In a second aspect, the invention relates to a method of screening for protein secreting 
recombinant host cells comprising the steps of 
15 (i) Providing a host cell comprising the secretion stress inducible promoter operably 

linked to nucleic acid sequence encoding a reporter protein or a regulator protein. 

(ii) Providing a nucleic acid sequence of interest. 

(iii) Introducing the nucleic acid sequence in (ii) into the host cell in (i) 

(iv) Culturing host cell obtained in (iii) under conditions promoting expression of the 
20 protein encoded by the nucleic acid sequence from (ii); and 

(v) Selecting the host cell exhibiting the desired level of reporter protein expression. 

In a particular embodiment, the regulator protein controls the expression of the reporter 
gene by activation or inhibition of the expression of the reporter protein. 

The host cell of the present invention may be selected from bacterial cells. 
25 in a t hird a spect, t he I nvention r elates t o a m ethod w here t he i nducible p romoter i s 

comprised by or comprises the nucleic acids 1-999 of SEQ ID NO.:1. 

In a fourth aspect, the invention relates to a method where the inducible promoter is in 
its normal position the promoter linked to a gene encoding a polypeptide which has at least 
70%, preferably 80%, or 90% or 95% or 98% identity to the amino acid sequence of SEQ ID 
30 NO.:2. 

In a fifth aspect, the stress inducible promoter is comprised by or comprises the 
Repeated octameric motif of SEQ ID NO.: 3. 

In a sixth aspect, the invention relates to a method where the reporter protein is 2-fold, 
preferably 5-fold, or 10-fold, or 20-fold, or 50-fold or 100-fold over expressed in a secretion 
35 stressed cell compared to a non secretion stressed cell. 

In a seventh aspect, the invention relates to a method where the reporter protein is 
selected from the group consisting of fluorescent protein, antibiotic markers, and substrate 
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converting enzymes. 

In an eighth aspect, the invention relates to a method where the host cell further 
comprises an IPTG-inducible promoter operably linked to a nucleic acid sequence encoding 
the amino acids of SEQ ID NO:2. 

5 

Definitions 

Prior to a discussion of the detailed embodiments of the invention, a definition of 
specific terms related to the main aspects of the invention is provided. 

10 In accordance with the present invention, there may be employed conventional 

molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See. e.g.. Sambrook. Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual. Second Edition (1989) Cold Spring Harbor 
Laboratory Press. Cold Spring Harbor. New York: DNA Cloning: A Practical Approach. 

15 Volumes I and II /D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 1984); 
Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds (1985)); Transcription And 
Translation (B.D. Hames & S.J. Higgins. eds. (1984)); Animal Cell Culture (R.I. Freshney, ed. 
(1986)); Immobilized Cells And Enzymes (IRL Press. (1986)); B. Perbal. A Practical Guide To 
Molecular Cloning (1984). 

20 

Expression cloning is the optimised cloning of a gene (containing an open reading 
frame) into an expression vector that will allow it to be expressed at a high level in a selected 
host. The plasmid will in most cases contain a strong promoter region that allows a strong 
transcription and optimal sequences for efficient translation of the gene of interest. 
25 Genes in a library will either be transcribed from their own promoter that might not be 

strong (genomic libraries), or from a promoter in the cloning vector that is typically not placed 
optimal for the gene to be highly expressed (genomic and cDNA libraries). 

The term parent protein (e.g. "parent enzyme") may be termed wild type protein (e.g. 
30 "wild type enzyme"). 

" A " polynucleotide" i s a single- o r d ouble-stranded p olymer of d eoxyribonucleotide o r 
ribonucleotide bases read from the 5' to the 3 1 end. Polynucleotides include RNA and DNA. 
and may be isolated from natural sources, synthesized in vitro, or prepared from a combination 
35 of natural and synthetic molecules. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine. uridine or cytidine; "RNA molecules") or 
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deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or deoxycylidine; 
"DNA molecules") in either single stranded form, or a double-stranded helix. Double stranded 
DNA-DNA, DNA-RNAand RNA-RNA helices are possible. The term nucleic acid molecule, 
and in particular DNA or RNA molecule, refers only to the primary and secondary structure of 
5 the molecule, and does not limit it to any particular tertiary or quaternary forms. Thus, this term 
includes double-stranded DNA found, inter alia, in linear or circular DNA molecules (e.g., 
restriction fragments), plasmids, and chromosomes. In discussing the structure of particular 
double-stranded DNA molecules, sequences may be described herein according to the normal 
convention of giving only the sequence in the 5* to 3 1 direction along the non-transcribed strand 
10 of DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant DNA 
molecule" is a DNA molecule that has undergone a molecular biological manipulation. 

A DNA "coding sequence" is a double-stranded DNA sequence, which is transcribed 
and translated into a polypeptide in a cell in vitro or in vivo when placed under the control of 

15 appropriate regulatory sequences. The boundaries of the coding sequence are determined by 
a start codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxyl) 
terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, cDNA 
from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
even synthetic DNA sequences. If the coding sequence is intended for expression in a 

20 eukaryotic cell, a polyadenylation signal and transcription termination sequence will usually be 
located 3' to the coding sequence. 

A "gene" refers a nucleic acid sequence encoding a peptide, a polypeptide or a protein. 
In a particular embodiment the term "reporter gene" refers to a nucleic acid sequence encoding 
25 a reporter protein. 

An "Expression vector" is a DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to additional segments that provide for its 
transcription. Such additional segments may include promoter and terminator sequences, and 
30 optionally one or more origins of replication, one or more selectable markers, an enhancer, a 
polyadenylation signal, and the like. Expression vectors are generally derived from plasmid or 
Viral DNA, or may contain elements of both. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
35 such as promoters, enhancers, terminators, and the like, that provide for the expression of a 
coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are control 
sequences. 
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A "secretory signal sequence" is a DNA sequence that encodes a polypeptide (a 
"secretory peptide") that, as a component of a larger polypeptide, directs the larger polypeptide 
through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is 
5 commonly cleaved to remove the secretory peptide during transit through the secretory 
pathway. 

The term "promoter" is used herein for its art-recognized meaning to denote a 
sequence flanking the gene containing DNA sequences that provide for the binding of RNA 
1 0 polymerase and initiation of transcription and furthermore it contains DNA sequences that are 
responsible for the regulation of the transcription of the gene. Promoter sequences are 
commonly, but not always, found In the 5' non-coding regions of genes. In a particular 
embodiment o f t he I nvention t he p romoter i s a n i nducible p romoter, e .g. a s ecretion stress 
induced promoter or a miss folding stress induced promoter. 



15 



"Operably linked", when referring to DNA segments, indicates that the segments are 
arranged so that they function in concert for their intended purposes, e.g. transcription initiates 
in the promoter and proceeds through the coding segment to the terminator. 

20 A coding sequence is "Under the control" of transcriptional and translational control 

sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA, which 
is then trans-RNA spliced and translated into the protein encoded by the coding sequence. 

"Isolated polypeptide" is a polypeptide which is essentially free of other nonenzyme] 
25 polypeptides, e.g., at least about 20% pure, preferably at least about 40% pure, more 
preferably about 60% pure, even more preferably about 80% pure, most preferably about 90% 
pure, and even most preferably about 95% pure, as determined by SDS-PAGE. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a chromosomal 
30 site of the cell. Preferably, the heterologous DNA includes a gene foreign to the cell. 

A cell has been "transfected" by exogenous or heterologous DNA when such DNA has 
been introduced inside the cell. A cell has been "transformed" by exogenous or heterologous 
DNA when the transfected DNA effects a phenotypic change. Preferably, the transforming 
35 DNA should be integrated (covalently linked) into chromosomal DNA making up the genome of 
the cell. 
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"Homologous recombination" refers to the insertion of a foreign DNA sequence of a 
vector in a chromosome. Preferably, the vector targets a specific chromosomal site for 
homologous recombination. For specific homologous recombination, the vector will contain 
sufficiently long regions of homology to sequences of the chromosome to allow complementary 
5 binding and incorporation of the vector into the chromosome. Longer regions of homology, and 
greater degrees of sequence similarity, may increase the efficiency of homologous 
recombination. 

A chaperone is a protein which assists another polypeptide in folding properly (Hartl et 
10 al.. 1994, TIBS 19:20-25; Bergeron et aL, 1994, TIBS 19:124-128; Demolder et aL. 1994, 
Journal of Biotechnology 32:179-189; Craig, 1993. Science 260:1902-1903; Gething and 
Sambrook, 1992, Nature 355:33-45; Puig and Gilbert, 1994, Journal of Biological Chemistry 
269:7764-7771; Wang and Tsou, 1993, The FASEB Journal 7:1515-11157; Robinson et al.. 
1994, Bio/Technology 1:381-384). The nucleic acid sequence encoding a chaperone may be 
15 obtained from the genes encoding Bacillus subtilis GroE proteins. For further examples, see 
Gething and Sambrook, 1992, supra, and Hartl et al., 1994, supra. 

A processing protease is a protease that cleaves a propeptide to generate a mature 
biochemically active polypeptide (Enderlin and Ogrydziak, 1994, Yeast 10:67-79; Fuller et al., 
20 1989, Proceedings of the National Academy of Sciences USA 86:1434-1438; Julius et al.. 
1984, Cell 37:1075-1089; Julius et al., 1983, Cell 32:839-852). The nucleic acid sequence 
encoding a processing protease may be obtained from the genes encoding Aspergillus niger 
Kex2, Saccharomyces cerevisiae dipeptidylaminopeptidase, Saccharomyces cerevisiae Kex2, 
and Yarrowia lipolytica dibasic processing endoprotease (xpr6). 



25 



The term "randomized library" of protein variants refers to a library with at least partially 
randomized composition of the members, e.g. protein variants. 

The term "functionality" of protein variants refers to e.g. enzymatic activity, binding to a 
30 ligand or receptor, stimulation of a cellular response (e.g. 3H-thymidine incorporation as 
response to a mitogenic factor), or anti-microbial activity. 

By the term "specific polyclonal antibodies" is meant polyclonal antibodies isolated 
according to their specificity for a certain antigen, e.g. the protein backbone. 



35 



"Spiked mutagenesis" is a form of site-directed mutagenesis, in which the primers used 
have been synthesized using mixtures of oligonucleotides at one or more positions. 
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Detailed description of the invention 

The present invention relates to a method of screening for protein secreting 
recombinant host cells comprising screening for promoter activity of a stress inducible 
5 promoter 

In a particular aspect the invention relates to a method of screening for protein 
secreting recombinant host cells comprising the steps of 

(i) Providing a host cell comprising the secretion stress inducible promoter operably 
linked to nucleic acid sequence encoding a reporter protein or a regulator protein. 
10 (ii) Providing a nucleic acid sequence of interest. 

(iii) Introducing the nucleic acid sequence in (ii) into the host cell in (i) 

(iv) Culturing host cell obtained in (iii) under conditions promoting expression of the 
protein encoded by the nucleic acid sequence from (ii); and 

(v) Selecting the host cell exhibiting the desired level of reporter protein expression. 
1 5 The host cell of the present invention may be selected from bacterial cells. 

Host cell 

The choice of a host cell will to a large extent depend upon the nucleic acid sequence 
of interest and its source. In the case where the host cell expresses an antimicrobial peptide, 
20 careful consideration should be given to the compatibility of the host cell and the expressed 
antimicrobial peptide. 

Useful unicellular cells are bacterial cells such as gram positive bacteria including, but 
not limited to, a Bacillus cell, e.g.. Bacillus alkalophilus. Bacillus agaradhaerens, Bacillus 
amyloliquefaciens. Bacillus brevis, Bacillus clausii, Bacillus circulans, Bacillus coagulans, 
25 Bacillus lautus, Bacillus lentus. Bacillus licheniformis, Bacillus megaterium. Bacillus 
stearothermophilus, Bacillus subtilis. Bacillus thuringiensis, or a Streptomyces cell, e.g., 
Streptomyces lividans or Streptomyces murinus. or gram negative bacteria such as E. co/i and 
Pseudomonas sp. Pseudomonas putida. In a preferred embodiment, the bacterial host cell is 
a Bacillus lentus. Bacillus licheniformis. Bacillus stearothermophilus, or Bacillus subtilis cell. In 
30 another preferred embodiment, the Bacillus cell is an alkalophilic Bacillus. Finally Lactococcus 
lactis is considered useful. 

It is to be understood that any number of host cells may be included in the screening 
assay. In expression cloning typically 10-1000 host cells are screened, whereas library 
screening typically includes in the range of 500-100.000 for a Bacillus library. The host cells 
35 may also secrete different proteins as different nucleic acid sequences may have been 
introduced as e.g. in library screening techniques. 

In another interesting embodiment, the host cell contains the inducible promoter, which 
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is comprised by or comprises nucleic acids 1-999 of SEQ ID NO.:1 linked to a reporter gene, 
and further an IPTG-inducible promoter operably linked to a nucleic acid sequence encoding 
the amino acids of SEQ ID NO:2 

5 The construction of the host cell (DN27) is described in Noone et at. 2000 (Noone D, 

Howell A, and Kevin M. Devine (2000) Expression of ykdA, Encoding a Bacillus subtilis 
Homologue of HtrA, Is Heat Shock Inducible and Negatively Autoregulated. Journal of 
Bacteriology 182: 1592-1599). The host contains the following features: the full ykdA promoter 
region (nucleic acids 1-999 of SEQ ID NO.: 1) is fused to the LacZ reporter gene. In addition 

10 an intact copy of the ykdA gene (nucleic acids 1000-2349 of SEQ ID NO.: 1) is placed under 
control of the IPTG-inducible Pspac promoter and the native ykdA gene is knocked out. In this 
way the ykdA gene itself is no longer secretion stress induced but instead ykdA expression is 
controllable by IPTG. The ykdA gene is negatively autoregulated. It is desirable to have a low 
level of the ykdA gene expressed, to avoid background expression of the reporter gene. 

15 

Inducible promoters. 

In the context of the present invention stress inducible promoter, inducible promoter 
and inducible promoter gene is used as synonymous. Non-limiting examples of bacillus 
inducible promoters are the ykdA promoter, yvtA promoter, and cssRS promoter. Two of these 

20 are members of the HtrA-like serine protease family encoded in the B. subtilis genome. YkdA 
(also called HtrA), YvtA (also called HtrB) (Hecker, M., and U. Volker. 1998. Non-specific, 
general and multiple stress resistance of growth-restricted Bacillus subtilis cells by the 
expression of the sigmaB regulon. Mol. Microbiol. 29:1129-1136). Promoter analysis suggests 
that H trA-like proteases e ncoded i n B . subtilis m ay h ave d istinctive b ut partially overlapping 

25 expression profiles and functions within the cell. Expression of ykdA and yvtA is induced both 
by heat shock and by secretion stress using a common mechanism. ykdA and yvtA expression 
is induced in response to heterologous protein secretion or so called "secretion stress". 
Secretion stress inducible promoters are characterised in that they are induced by a 
multifactorial stimulus consisting of 

30 (i) the secretion load (i.e. the total number of proteins and the amount of each 

protein being processed and/or secreted) 

(ii) the level of protein maturation, and 

(iii) the level of aberrant protein degradation. 

This multifactorial stimulus is called secretion stress, and the promoters are stress 
35 induced promoters or secretion stress induced promoters. 



This has been shown by Noone et al (2001) where cells expressing and secreting 
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recombinant amylases, showed a dramatic increase in expression of a ykda-lacZ construct in 
the transition phase of the growth cycle (50 fold more lacZ accumulation). A similar, but not as 
dramatic response was seen for a yvtA-lacZ construct. The recombinant amylase induction of 
both promoter-lacZ constructs occurred at the transcriptional level. Antelmann et al. 

5 (Antelmann H; Darmon E; Noone D; Veening J; Westers H; Bron S; Kuipers OP; Devine KM; 
Hecker M; van Dijl JM. (2003): The extracellular proteome of Bacillus subtilis under secretion 
stress conditions. Molecular Microbiology, 49: 143-156) showed by Northern blot analysis that 
the ykdA transcript was increased by a factor of 10-20 by heterologous amylase expression. 
Expression of ykdA is negatively autoregulated. This was demonstrated in cells containing the 

10 ykdA promoter linked to the beta-galactosidase reporter gene (Noone D, Howell A, and Kevin 
M. Devine (2000): Expression of ykdA, Encoding a Bacillus subtilis Homologue of HtrA, Is Heat 
Shock Inducible and Negatively Autoregulated. Journal of Bacteriology182: 1592-1599). The 
level of beta-galactosidase steadily increases in ykdA mutant cells throughout exponential 
growth, in contrast to ykdA+ cells, where expression levels are low and constant. Primer 

1 5 extension and Northern analysis show that the regulation occurs at the level of transcription. 

Members of the HtrA family of serine proteases are widely distributed among bacteria 
and have atso been found in yeast, plants, and humans. Information derived from completely 
sequenced genomes shows that most eubacteria have a single HtrA-like serine protease. 

20 However, a significant number of bacterial genomes encode more than one HtrA-like serine 
protease. Mycobacterium tuberculosis has four such genes; Escherichia coli, Bacillus subtilis, 
Treponema pallidum, Deinococcus radlodurans, and Synechocystis each have three copies, 
while Haemophilus influenzae and Pseudomonas aeruginosa each have two copies. In some 
archaebacteria a recognizable member of the HtrA-protease family has also been identified. 

25 The proteins belonging to the HtrA family are characterized by an amino-terminal 

domain that participates in protein localization, a catalytic domain containing an active serine 
residue, and a PDZ domain that functions in multimerisation of the protein into the active 
dodecamer structure and perhaps also in identification of target proteins. Recent work has 
shown that HtrA can function both as a molecular chaperone and as a protease (Spiess et al. 

30 1999). The switch between these activities is temperature dependent, with the chaperone 
activity predominating at lower temperatures and the protease activity predominating at high 
temperature. 

YkdA and yvtA are regulated by the two component system CssR-CssS of the 
membrane. CssR-CssS responds to heat and secretion stress by activating the expression of 
35 ykdA and yvtA (Darmon E; Noone D; Masson A; Bron S; Kuipers OP; Devine KM; van Dijl JM 
(2002): A novel class of heat and secretion stress-responsive genes is controlled by the 
autoregulated CssRS two-component system of Bacillus subtilis. Journal of Bacteriology 184: 

9 
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5661-5671). Misfolded proteins can accumulate in the cell through thermal denaturation or 
from a limited availability of appropriate folding catalysts at the extracytoplasmic side. The 
synthesis of proteases at elevated levels is one of a variety of cellular responses that 
counteract the detrimental effects of the presence of misfolded proteins. The latter mechanism 
5 would operate particularly on high-level production of secreted proteins. In this respect, it is 
important to bear in mind that most proteins of B. subtilis are transported across the membrane 
in an unfolded conformation via the Sec translocation channel (Tjalsma, H., A. Bolhuis, J. D. 
Jongbloed, S. Bron, and J. M. van Dljl. 2000: Signal peptide-dependent protein transport in 
Bacillus subtilis: a genome-based survey of the secretome. Microbiol. Mol. Biol. Rev. 64:515- 
10 547). The CssRS two-component regulatory system detects secretion stress by sensing the 
accumulation of misfolded proteins at the membrane-cell wall interface (Hyyrylainen. H. K., A. 
Bolhuis, E. Darmon, L. Muukkonen, P. Koski, M. Vitikainen, M. San/as, Z. Pragai, S. Bron, J. 
M. van Dijl, and V. P. Kontinen (2001): A novel two-component regulatory system of Bacillus 
subtilis for the survival of severe secretion stress. Mol. Microbiol. 41:1159-1172). The CssRS- 
15 inducing signal is not cytosolic misfolded proteins, since neither htrA nor htrB expression is 
induced by puromycin addition, which stops protein synthesis in the cell. The present 
observation that the expression of CssRS-controlled genes is responsive both to heat and 
secretion stress indicates that the CssRS system can sense misfolded proteins 
extracytosolically, irrespective of the cause that leads to their accumulation. The cssRS operon 
20 it self was shown to be transcriptionally induced by secretion stress caused by overproduction 
of a heterologous protein (Darmon et al. 2002). This was detected by an increase in 
reportergene activity in a host expressing a recombinant amylase and containing the cssRS 
operon promoter fused to the bgaB reporter. 

Comparison of the three secretion stress-inducible promoters, ykdA, yvtA and cssRS 
25 show that they all three contain repeated octameric motifs identical or close to TTTTCATA 
(SEQ ID NO.:3). It has been demonstrated that a point mutation in repeat I of the octameric 
consensus sequence affects heat and secretion stress induction of both the yvtAB and cssRS 
genes (Darmon et al. 2002). These data show that stress-induced expression of yvtA and 
cssRS are linked through this common regulatory sequence, perhaps to make the levels of 
30 protease (YkdA and yvtA) and regulator (CssR and CssS) responsive to the prevailing stress 
conditions. The CssRS system of Bacillus bears some resemblance to the CpxA-CpxR two- 
component system from E. coll. First, CpxA and CssS show amino acid sequence similarities, 
and the same is true for CpxR and CssR (Hyyrylainen et al. 2001). Second, these two systems 
control the transcription of genes encoding HtrA-like proteases: htrA (degP) of E. coli is 
35 regulated by the CpxAR system, and ykdA and yvtA of B. subtilis is regulated by the CssRS 
system. Finally, like the cpxAR operon, the transcription of the cssRS operon is autoregulated. 



10 
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Functional homologs are defined as proteins with similar functions. As an example 
homologs of HtrA proteases are proteins with similar functions, i.e. proteases induced by 
secretion stress and misfolded (aberrant) proteins. 

5 Yet in another interesting aspect of the invention the inducible promoter In its normal 

position is the promoter linked to a gene encoding a polypeptide which has at least 70%. 
preferably 80% or 90%. more preferably at least 95% or 98% identity to the amino acid 
sequence of SEQ ID NO.:2. The term "normal position" is this context to be understood as the 
occurrence of the promoter as it is found when operably linked to a protein not protein 

10 engineered. 

The degree of Identity between two amino acid sequences is determined by the Clustal 
method {Higgins. 1989. CABIOS 5: 151-153) using the LASERGENE™ MEGALIGN™ 
software (DNASTAR, Inc., Madison. Wl) with an identity table and the following multiple 
alignment parameters: Gap penalty of 10, and gap length penalty of 10. Pairwise alignment 
1 5 parameters are Ktuple=1 , gap penalty=3, windows=5, and diagonals=5. The degree of identity 
between two nucleotide sequences may be determined using the same algorithm and software 
package as described above with the following settings: Gap penalty of 10, and gap length 
penalty of 10. Pair wise alignment parameters are Ktuple=3, gap penalty=3 and windows=20. 

20 The method of the present invention may also used to identify new stress induced 

promoters by providing a host cell capable of secreting a protein and introduce a possible 
stress inducible promoter operably linked to a nucleic acid sequence encoding a reporter 
protein or a regulator protein into the host cell. By selecting the host cell exhibiting the desired 
level of reporter protein expression host cells containing stress inducible promoters may be 

25 identified and subsequently the stress inducible promoter may be isolated by techniques used 
in the art. 

Reporter protein. 

Reporter genes are nucleic acid sequences encoding easily assayed proteins (hereinafter 
30 reporter proteins). Reporter genes are frequently used as indicators of transcriptional activity or 
activation of particular signalling pathways within the cell. 

In the method of the present invention, the inducible promoter gene may be operably 
linked to a nucleic acid sequence encoding a reporter protein which is expressed when the 
inducible promoter is activated as described above. 
35 Alternatively, the expression of the reporter protein may be controlled by a regulator 

protein operably linked to the stress inducible promoter. A regulator protein is a protein that 
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10 



control the expression of a gene by interacting with a control site in DNA an influencing the 
initiation o f t ranscription. T he regulator g ene may a ct a s a n a ctivator, i .e. a ct a s a p ositive 
regulator of transcription or as a repressor, i.e. decrease the level of transcription. 

Measuring the amount of reporter protein expressed by the host cell obviously depends 
on the choice of reporter protein, but non-limiting examples are given below. 

Commonly used reporter proteins are chloramphenicol acetyltransferase, beta- 
galactosidase, beta-glucuronidase, aequorin. Green fluorescent protein. Red fluorescent 
protein, B lue fluorescent p rotein, Y ellow fluorescent p rotein, I uciferase, I ux, h erne, a ntibiotic 
markers, alkaline phosphatase, and beta-lactamase 



Nucleic Acid sequence. 

In the method of the present invention a nucleic acid sequence of interest may be 

obtained in various ways known in the art. Non-limiting examples are: isolation of wild type 

genes, generation of protein engineered variants, site directed mutagenesis, library screening. 
15 The host cell may comprise one or more. e.g. 2-15, particularly 2-10. more particularly 2-4. 

chromosomally i ntegrated copies of the n ucleic acid s equence of i nterest. The n ucleic a cid 

sequence of interest may be cloned on a plasmid and remain on the plasmid in the cell. 

As used herein the term "nucleic acid sequence" is intended to indicate any nucleic 

acid molecule of cDNA. genomic DNA. synthetic DNA or RNA origin. The term "sequence" is 
20 intended to indicate a nucleic acid segment which may be single- or double-stranded, and 

which may be based on a complete or partial nucleotide sequence encoding a polypeptide. 

The nucleic acid sequence of interest may suitably be of genomic or cDNA origin, for 

instance obtained by preparing a genomic or cDNA library and screening for DNA sequences 

coding for all or part of the polypeptide by hybridization using synthetic oligonucleotide probes 
25 in accordance with standard techniques (cf. Sambrook et al., supra). 

The nucleic acid sequence may also be prepared synthetically by established standard 

methods, e.g. the phosphoamidite method described by Beaucage and Caruthers. 

Tetrahedron Letters 22 (1981), 1859 - 1869, or the method described by Matthes et al., EMBO 

Journal 3 (1984), 801 - 805. According to the phosphoamidite method, oligonucleotides are 
30 synthesized, e.g. in an automatic DNA synthesizer, purified, annealed, ligated and cloned in 

suitable vectors. 

Furthermore, the nucleic acid sequence may be of mixed synthetic and genomic, mixed 
synthetic and cDNA or mixed genomic and cDNA origin prepared by ligating fragments of 
synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding to various 
35 parts of the entire nucleic acid construct, in accordance with standard techniques. 

The nucleic acid sequence may also be prepared by polymerase chain reaction using 
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specific primers, for instance as described in US 4,683,202 or Saiki et aL, Science 239 (1988), 
487-491. 

The techniques used to isolate or clone a nucleic acid sequence encoding a 
polypeptide are known in the art and include isolation from genomic DNA, preparation from 

5 cDNA, or a combination thereof. The cloning of the nucleic acid sequences of the present 
invention from such genomic DNA can be effected, e.g.. by using the well known polymerase 
chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA 
fragments with shared structural features. See e.g. Innis et aL, 1990. A Guide to Methods and 
Application. Academic Press. New York. Other nucleic acid amplification procedures such as 

10 ligase chain reaction (LCR). ligated activated transcription (LAT) and nucleic acid sequence- 
based amplification (NASBA) may be used. The nucleic add sequence may be cloned from a 
strain producing the polypeptide, or from another related organism and thus, for example, may 
be an allelic or species variant of the polypeptide encoding region of the nucleic acid 
sequence. 

15 The term "isolated" nucleic acid sequence as used herein refers to a nucleic acid 

sequence which is essentially free of other nucleic acid sequences, e.g.. at least about 20% 
pure, preferably at least about 40% pure, more preferably about 60% pure, even more 
preferably about 80% pure, most preferably about 90% pure, and even most preferably about 
95% pure, as determined by agarose gel electorphoresis. For example, an isolated nucleic 

20 acid sequence can be obtained by standard cloning procedures used in genetic engineering to 
relocate the nucleic acid sequence from its natural location to a different site where it will be 
reproduced. The cloning procedures may involve excision and isolation of a desired nucleic 
acid fragment comprising the nucleic acid sequence encoding the polypeptide, insertion of the 
fragment into a vector molecule, and incorporation of the recombinant vector into a host cell 

25 where multiple copies or clones of the nucleic acid sequence will be replicated. The nucleic 
acid sequence may be of genomic, cDNA, RNA. semisynthetic, synthetic origin, or any 
combinations thereof. 



Nucleic acid sequence library 
30 Preparation of a nucleic acid sequence library can be achieved by use of known 

methods. 

Procedures for extracting genes from a cellular nucleotide source and preparing a gene 
library a re d escribed i n e.g. P itcher e t a I.. "Rapid extraction of b acterial g enomic D NA with 
guanidium thiocyanate", Lett. Appl. Microbiol., 8. pp 151-156, 1989. Dretzen, G. et aL. "A 
35 reliable method for the recovery of DNA fragments from agarose and acrylamide gels", Anal. 
Biochem., 112, pp 295-298, 1981, WO 94/19454 and Diderichsen et aL, "Cloning of aldB, 
which encodes alpha-acetolactate decarboxylase, an exoenzyme from Bacillus brevis", J. 
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Bacteriol., 172, pp 4315-4321, 1990. 

Procedures for preparing a gene library from an in vitro made synthetic nucleotide 
source can be found in (e.g. described by Stemmer, Proc. Natl. Acad. Sci. USA, 91, pp. 
10747-10751, 1994 or WO 95/17413). 
5 The library can also be screened as autonomically replicating plasmid library. 

Manipulating the nucleic acid sequences of a library 

In a particular embodiment the genes of a gene library may before, during or after 
initiating the screening be subjected to alterations and or mutations by genetic engineering. 
10 Generation of libraries of genes encoding variants of enzymes can be done i n a variety of 
ways: 

(1) Error prone PCR employs a low fidelity replication step to introduce random point 
mutations at each round of amplification (Caldwell and Joyce (1992). PCR Methods and 
Applications vol.2 (1), pp.28-33). Error-prone PCR mutagenesis is performed using a plasmid 

15 encoding the wild-type, i.e. wt, gene of interest as template to amplify this gene with flanking 
primers under PCR conditions where increased error rates leads to introduction of random 
point mutations. The PCR conditions utilized are typically: 10 mM Tris-HCI. pH 8.3, 50 mM 
KCI, 4 mM MgCI2, 0.3 mM MnCI2, 0.1 mM dGTP/dATP, 0.5 mM dTTP/dCTP, and 2.5 u Taq 
polymerase per 100 micro L of reaction. The resultant PCR fragment is purified on a gel and 

20 cloned using standard molecular biology techniques. 

(2) Oligonucleotide directed mutagenesis in single codon position (including deletions 
or insertions), e.g. by SOE-PCR is described by Kirchhoff and Desrosiers. PCR Methods and 
Applications, 1993, 2, 301-304. This method is performed as follows: Two independent PCR 
reactions are performed with 2 internal, overlapping primers, wherein one or both contain a 

25 mutant sequence and 2 external primers, which may encode restriction sites, thereby creating 
2 overlapping PCR fragments. These PCR fragments are purified, diluted, and mixed in molar 
ratio 1:1. The full length PCR product is subsequently obtained by PCR amplification with the 
external primers. The PCR fragment is purified on gel and cloned using standard molecular 
biology techniques. 

30 (3) Oligonucleotide directed randomization in single codon position, such as saturation 

mutagenesis, may be done e.g. by SOE-PCR as described above, but using primers with 
randomized nucleotides. For example NN(G/T), wherein N is any of the 4 bases G.A.T or C, 
will yield a mixture of codons encoding all possible amino acids. 

(4) Combinatorial site-directed mutagenesis libraries may be employed, where several 
35 codons can be mutated at once using (2) and (3) above. For multiple sites, several overlapping 

PCR fragments are assembled simultaneously in a SOE-PCR setup. 

(5) Another protocol employs synthetic gene libraries preparation. Wild type, i.e. wt, 
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genes can be assembled from multiple overlapping oligonucleotides (typically 40-100 
nucleotides in length; (Stemmer et al., (1995), Gene 164. 49-53). By including mixtures of wt 
and mutant variants of the same oligo at various positions in the gene, the resulting assembled 
gene will contain mutations at various positions with mutagenic rates corresponding to the 
5 ratios of wt to mutant primers. 

(6) Still another method employs multiple mutagenic primers to generate libraries with 
multiple mutated positions. First an uracil-containing nucleotide template encoding a 
polypeptide of interest is generated and 2-50 mutagenic primers corresponding to at least one 
region of identity in the nucleotide template are synthezised so that each mutagenic primer 

10 comprises at least one substitution of the template sequence (or: insertion/deletion of bases) 
resulting in at least one amino acid substitution (or insertion/deletion) of the amino acid 
sequence encoded by the uracil-containing nucleotide template. The mutagenic primers are 
then contacted with the uracil-containing nucleotide template under conditions wherein a 
mutagenic primer anneals to the template sequence. This is followed by extension of the 

15 primer(s) catalyzed by a polymerase to generate a mixture of mutagenized polynucleotides 
and uracil-containing templates. Finally, a host cell is transformed with the polynucleotide and 
template mixture wherein the template is degraded and the mutagenized polynucleotide 
replicated, generating a library of polynucleotide variants of the gene of interest. 

(7) Libraries may be created by shuffling e.g. by recombination of two or more wt genes 
20 or genes encoding variant proteins created by any combination of methods (1)-(6) (above) by 

DNA shuffling. 

In the method of the present invention, the nucleic acid sequence may be introduced 
into the host cell in the form of a nucleic acid construct. 

25 

Nucleic Acid Constructs 

The present invention also relates to nucleic acid constructs comprising a nucleic acid 
sequence of the present invention operably linked to one or more control sequences that direct 
the expression of the coding sequence in a suitable host cell under conditions compatible with 

30 the control sequences. 

A nucleic acid sequence encoding a polypeptide of the present invention may be 
manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of 
the nucleotide sequence prior to its insertion into a vector may be desirable or necessary 
depending on the expression vector. The techniques for modifying nucleotide sequences 
35 utilizing recombinant DNA methods are well known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleotide 
sequence which is recognized by a host cell for expression of the nucleotide sequence. The 
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promoter sequence contains transcriptional control sequences, which mediate the expression 
of the polypeptide. The promoter may be any nucleotide sequence which shows 
transcriptional activity in the host cell of choice including mutant, truncated, and hybrid 
promoters, and may be obtained from genes encoding extracellular or intracellular 

5 polypeptides either homologous or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present invention, especially in a bacterial host cell, are the promoters 
obtained from the £. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus 
subtilis levansucrase gene (sacS), Bacillus licheniformis alpha-amylase gene {amyL), Bacillus 

10 stearothermophilus maltogenic amylase gene (arnyM), Bacillus amylollquefaciens alpha- 
amylase gene (amyQ\ Bacillus licheniformis penicillinase gene (penP), Bacillus subtilis xylA 
and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff ef a/., 1978, 
Proceedings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac 
promoter (DeBoer et ai f 1983, Proceedings of the National Academy of Sciences USA 80: 21- 

15 25). Further promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242: 74-94; and in Sambrook et a/., 1989, supra. 

The control sequence may also be a signal peptide coding region that codes for an 
amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded 
polypeptide into the cell's secretory pathway. The 5' end of the coding sequence of the 

20 nucleotide sequence may inherently contain a signal peptide coding region naturally linked in 
translation reading frame with the segment of the coding region which encodes the secreted 
polypeptide. Alternatively, the 5' end of the coding sequence may contain a signal peptide 
coding region which is foreign to the coding sequence. The foreign signal peptide coding 
region may be required where the coding sequence does not naturally contain a signal peptide 

25 coding region. Alternatively, the foreign signal peptide coding region may simply replace the 
natural signal peptide coding region in order to enhance secretion of the polypeptide. 
However, any signal peptide coding region which directs the expressed polypeptide into the 
secretory pathway of a host cell of choice may be used in the present invention. 

Effective signal peptide coding regions for bacterial host cells are the signal peptide 

30 coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus 
stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis 
alpha-amylase, Bacillus stearothermophilus neutral proteases (nprT % nprS t nprM), and Bacillus 
subtilis prsA. Further signal peptides are described by Simonen and Palva, 1993, 
Microbiological Reviews 57: 109-137. 

35 Where both signal peptide and propeptide regions are present at the amino terminus of 

a polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide 
and the signal peptide region is positioned next to the amino terminus of the propeptide region. 
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It may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the host cell. Examples of regulatory 
systems are those which cause the expression of the gene to be turned on or off in response 
to a chemical or physical stimulus, including the presence of a regulatory compound. 
5 Regulatory systems in prokaryotic systems include the lac, tec, and trp operator systems. 



Expression Vectors 

The present invention also relates to recombinant expression vectors comprising the 
nucleic acid construct of the invention. The various nucleotide and control sequences 

10 described above may be joined together to produce a recombinant expression vector which 
may include one or more convenient restriction sites to allow for insertion or substitution of the 
nucleotide sequence encoding the polypeptide at such sites. Alternatively, the nucleotide 
sequence of the present invention may be expressed by inserting the nucleotide sequence or a 
nucleic acid construct comprising the sequence into an appropriate vector for expression. In 

15 creating the expression vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 
can be conveniently subjected to recombinant DNA procedures and can bring about the 
expression of the nucleotide sequence. The choice of the vector will typically depend on the 

20 compatibility of the vector with the host cell into which the vector is to be introduced. The 
vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, i.e., a vector which exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal replication, 
e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial 

25 chromosome. 

The vector may contain any means for assuring self-replication. Alternatively, the 
vector may be one which, when introduced into the host cell, is integrated into the genome and 
replicated together with the chromosome(s) into which it has been integrated. Furthermore, a 
single vector or plasmid or two or more vectors or plasmids which together contain the total 
30 DNA to be introduced into the genome of the host cell, or a transposon may be used. 

The vectors of the present invention preferably contain one or more selectable markers 
which permit easy selection of transformed cells. A selectable marker is a gene the product of 
which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the like. 

35 Examples o f b acterial s electable m arkers a re the d a/ g enes from 8 acillus s ubtilis o r 

Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracycline resistance. 
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The vectors of the present invention preferably contain an element(s) that permits 
stable integration of the vector into the host cell's genome or autonomous replication of the 
vector in the cell independent of the genome. 

For integration into the host cell genome, the vector may rely on the nucleotide 
5 sequence encoding the polypeptide or any other element of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous recombination. Alternatively, 
the vector may contain additional nucleotide sequences for directing integration by 
homologous recombination into the genome of the host cell. The additional nucleotide 
sequences enable the vector to be integrated into the host cell genome at a precise location(s) 

10 in the chromosome(s). To increase the likelihood of integration at a precise location, the 
integrational elements should preferably contain a sufficient number of nucleotides, such as 
100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 
base pairs, which are highly homologous with the corresponding target sequence to enhance 
the probability of homologous recombination. The integrational elements may be any 

15 sequence that is homologous with the target sequence in the genome of the host celt. 
Furthermore, the integrational elements may be non-encoding or encoding nucleotide 
sequences. On the other hand, the vector may be integrated into the genome of the host cell 
by non-homologous recombination. 

For autonomous replication, the vector may further comprise an origin of replication 

20 enabling the vector to replicate autonomously in the host cell in question. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, 
pACYC177, and pACYC184 permitting replication in E co//, and pUB110, pE194. pTA1060 f 
and pAMfil permitting replication in Bacillus, Examples of origins of replication for use in a 
yeast host cell are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 

25 and CEN3, and the combination of ARS4 and CEN6. The origin of replication may be one 
having a mutation which makes its functioning temperature-sensitive in the host cell (see, e.g., 
Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

More than one copy of a nucleotide sequence of the present invention may be inserted 
into the host cell to increase production of the gene product. An increase in the copy number 

30 of the nucleotide sequence can be obtained by integrating at least one additional copy of the 
sequence into the host cell genome or by including an amplifiable selectable marker gene with 
the nucleotide sequence where cells containing amplified copies of the selectable marker 
gene, and thereby additional copies of the nucleotide sequence, can be selected for by 
cultivating the cells in the presence of the appropriate selectable agent. 

35 The procedures used to ligate the elements described above to construct the 

recombinant expression vectors of the present invention are well known to one skilled in the art 
(see, e.g., Sambrook etai, 1989 P supra). 
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Transformation 

The introduction of a vector into a bacterial host cell may. for instance, be effected by 
protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 
5 168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of 
Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular 
Biology 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 
6: 742-751), or conjugation (see, e.g., Koehler and Thome. 1987, Journal of Bacteriology 169: 
5771-5278). 

10 

Enzymes. 

A particular embodiment of the present invention is the secretion of enzyme, where the 
enzyme may be selected from the group of enzymes comprising glycosyl hydrolases, 
carbohydrases. peroxidases, proteases, lipases, phytases, polysaccharide lyases, 
1 5 oxidoreductases, transglutaminases and glycoseisomerases, in particular the following. 

Parent proteases 

Parent proteases (i.e. enzymes classified under the Enzyme Classification number E.C. 
3.4 in accordance with the Recommendations (1992) of the International Union of Biochemistry 
20 and Molecular Biology (IUBMB)) include proteases within this group. 

Examples include proteases selected from those classified under the Enzyme Classification 
(E.C.) numbers: 

3.4.11 (i.e. so-called aminopeptidases), including 3.4.11.5 (Prolyl aminopeptidase), 
3.4.11.9 (X-pro aminopeptidase), 3.4.11.10 (Bacterial leucyl aminopeptidase), 3.4.11.12 
25 (Thermophilic aminopeptidase), 3.4.11.15 (Lysyl aminopeptidase), 3.4.11.17 (Tryptophanyl 
aminopeptidase), 3.4.11.18 (Methionyl aminopeptidase). 

3.4.21 (i.e. so-called serine endopeptidases), including 3.4.21.1 (Chymotrypsin), 
3.4.21.4 (Trypsin), 3.4.21.25 (Cucumisin), 3.4.21.32 (Brachyurin). 3.4.21.48 (Cerevisin) and 
3.4.21.62 (Subtilisln); 3.4.22 (i.e. so-called cysteine endopeptidases). including 3.4.22.2 
30 (Papain), 3.4.22.3 (Ftcain). 

3.4.22.6 (Chymopapain). 3.4.22.7 (Asclepain), 3.4.22.14 (Actinidain), 3.4.22.30 
(Caricain) and 3.4.22.31 (Ananain); 

3.4.23 (i.e. so-called aspartic endopeptidases), including 3.4.23.1 (Pepsin A), 3.4.23.18 
(Asperglllopepsin I), 3.4.23.20 (Penicillopepsin) and 3.4.23.25 (Saccharopepsin); and 
35 3.4.24 (i.e. so-called metalloendopeptidases), including 3.4.24.28 (Bacillolysin). 

Examples of relevant subtilisins comprise subtilisin BPN\ subtilisin amylosacchariticus, 
subtilisin 168, subtilisin mesentericopeptidase, subtilisin Carlsberg, subtilisin DY, subtilisin 309, 
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subtilisin 147, thermitase, aqualysin, Bacillus PB92 protease, proteinase K, Protease TW7, 
and Protease TW3. 

Specific examptes of such readily available commercial proteases include Esperase®, 
Alcalase®. Neutrase®. Dyrazym®, Savinase®, Pyrase®, Pancreatic Trypsin NOVO (PTN), 
5 Bio-Feed® Pro, Clear-Lens Pro ® (all enzymes available from Novozymes A/S). 

Examptes of other commercial proteases include Maxtase®, Maxacal®, Maxapem® 
marketed by Gist-Brocades N.V., Opticlean® marketed by Solvay et Cie. and Purafect® 
marketed by Genencor International. 

It is to be understood that also protease variants are contemplated as the parent 
10 protease. Examples of such protease variants are disclosed in EP 130.756 (Genentech), EP 
214.435 (Henkel), WO 87/04461 (Amgen), WO 87/05050 (Genex), EP 251.446 (Genencor), 
EP 260.105 (Genencor), Thomas et al.. (1985), Nature. 318, p. 375-376. Thomas et al.. 
(1987), J. Mol. Biol., 193, pp. 803-813. Russel et al.. (1987), Nature, 328, p. 496-500. WO 
88/08028 (Genex). WO 88/08033 (Amgen). WO 89/06279 (Novo Nordisk A/S), WO 91/00345 
15 (Novo Nordisk A/S). EP 525 610 (Solvay) and WO 94/02618 (Gist-Brocades N.V.). 

The activity of proteases can be determined as described in "Methods of Enzymatic 
Analysis", third edition. 1984. Verlag Chemie, Weinheim, vol. 5. 

Parent Lipases 

20 Parent lipases (i.e. enzymes classified under the Enzyme Classification number E.C. 

3.1.1 (Carboxylic Ester Hydrolases) in accordance with the Recommendations (1992) of the 
International Union of Biochemistry and Molecular Biology (IUBMB)) include lipases within this 
group. 

Examples include lipases selected from those classified under the Enzyme 
25 Classification (E.C.) numbers: 

3.1.1 (i.e. so-called Carboxylic Ester Hydrolases), including (3.1.1.3) Triacylglycerol 
lipases, (3.1.1.4.) Phosphorlipase A2. 

Examples of lipases include lipases derived from the following microorganisms: 
Humicola. e.g. H. brevispora, H. lanuginosa, H. brevis var. thermoidea and H. insolens (US 
30 4,810,414). 

Pseudomonas, e.g. Ps. fragi, Ps. stutzeri, Ps. cepacia and Ps. fluorescens (WO 
89/04361), or Ps. plantarii or Ps. gladioli (US patent no. 4,950,417 (Solvay enzymes)) or Ps. 
alcaligenes and Ps. pseudoalcaligenes (EP 218 272) or Ps. mendocina (WO 88/09367; US 
5,389.536). 

35 Fusarium, e.g. F. oxysporum (EP 1 30,064) or F. solani pisi(WO 90/09446). 

Mucor (also called Rhizomucor), e.g. M. miehei (EP 238 023). 
Chromobacterium (especially C. viscosum). Aspergillus (especially A. niger). 
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Candida, e.g. C. cylindracea (also called C. rugosa) or C. antarctica (WO 88/02775) or 
C. antarctica lipase A or B (WO 94/01541 and WO 89/02916). 
Geotricum, e.g. G. candidum (Schimada et al., (1989), J. Biochem., 106, 383-388). 
Penicillium, e.g. P. camembertii (Yamaguchi et al., (1991). Gene 103, 61-67). 
5 Rhizopus, e.g. R. delemar (Hass et al., (1991), Gene 109. 107-113) or R. niveus (Kugimiya et 
al., (1 992) Biosci.Biotech. Biochem 56. 716-719) or R. oryzae. 

Bacillus, e.g. B. subtilis (Dartois et al., (1993) Biochemica et Biophysica acta 1131 , 253-260) or 

B. stearothermophilus (JP 64/7744992) or B. pumilus (WO 91/16422). 

Specific examples of readily available commercial lipases include Lipolase®, Lipolase® 
1 0 Ultra, Lipozyme®, Palatase®, Novozym® 435, Lecitase® (all available from Novozymes MS). 

Examples of other lipases are Lumafast®, Ps. mendocian lipase from Genencor Int. Inc.; 

Lipomax®, Ps. pseudoalcaligenes lipase from Gist Brocades/Genencor Int. Inc.; Fusarium 

solani lipase (cutinase) from Unilever; Bacillus sp. lipase from Solvay enzymes. Other lipases 

are available from other companies. 
15 It is to be understood that also lipase variants are contemplated as the parent enzyme. 

Examples of such are described in e.g. WO 93/01285 and WO 95/22615. 

The activity of the lipase can be determined as described in "Methods of Enzymatic 

Analysis", Third Edition, 1984, Verlag Chemie, Weinhein, vol. 4, or as described in AF 95/5 GB 

(available on request from Novozymes A/S). 

20 

Parent Oxidoreductases 

Parent oxidoreductases (i.e. enzymes classified under the Enzyme Classification 
number E.C. 1 (Oxidoreductases) in accordance with the Recommendations (1992) of the 
International Union of Biochemistry and Molecular Biology (IUBMB)) include oxidoreductases 
25 within this group. 

Examples include oxidoreductases selected from those classified under the Enzyme 

Classification (E.C.) numbers: 

Glycerol-3-phosphate dehydrogenase _NAD+_ (1.1.1.8). Glycerol-3-phosphate 
dehydrogenase _NAD(P)+_ (1.1.1.94), Glycerol-3-phosphate 1 -dehydrogenase _NADP_ 

30 (1.1.1.94), Glucose oxidase (1.1.3.4), Hexose oxidase (1.1.3.5), Catechol oxidase (1.1.3.14), 
Bilirubin oxidase (1.3.3.5). Alanine dehydrogenase (1.4.1.1), Glutamate dehydrogenase 
(1.4.1.2), Glutamate dehydrogenase _NAD(P)+_ (1.4.1.3), Glutamate dehydrogenase 
lNADP+_ (1.4.1.4), L-Amino acid dehydrogenase (1.4.1.5), Serine dehydrogenase (1.4.1.7), 
Valine dehydrogenase _NADP+_ (1.4.1.8), Leucine dehydrogenase (1.4.1.9), Glycine 

35 dehydrogenase (1.4.1.10), L-Amino-acid oxidase (1 .4.3.2.), D-Amino-acid oxidase(1 .4.3.3), L- 
Glutamate oxidase (1.4.3.11), Protein-lysine 6-oxidase (1.4.3.13), L-lysine oxidase (1.4.3.14), 
L-Aspartate oxidase (1.4.3.16), D-amino-acid dehydrogenase (1.4.99.1), Protein disulfide 
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reductase (1.6.4.4), Thioredoxin reductase (1.6.4.5). Protein disulfide reductase (glutathione) 
(1.8.4.2), Laccase (1.10.3.2), Catalase (1.11.1.6). Peroxidase (1.11.1.7). Lipoxygenase 
(1.13.11.12). Superoxide dismutase (1.15.1.1) 

Said Glucose oxidases may be derived from Aspergillus niger. Said Laccases may be 
5 derived f rom P olyporus p insitus. M yceliophtora t hermophila, C oprinus c inereus. R hizoctonia 
solani, Rhizoctonia praticola, Scytalidium thermophilum and Rhus vemicifera. Bilirubin 
oxidases may be derived from Myrothechecium verrucaria. The Peroxidase may be derived 
from e.g. Soy bean, Horseradish or Coprinus cinereus. The Protein Disulfide reductases 
Protein Disulfide reductases of bovine origin, Protein Disulfide reductases derived from 
1 0 Aspergillus oryzae or Aspergillus niger, and DsbA or DsbC derived from Escherichia coli . 

Specific examples of readily available commercial oxidoreductases include Gluzyme 
(enzyme available from Novozymes A/S). However, other oxidoreductases are available from 
others. 

It is to be understood that also variants of oxidoreductases are contemplated as the 
15 parent enzyme. 

The activity of oxidoreductases can be determined as described in "Methods of 
Enzymatic Analysis", third edition. 1984. Verlag Chemie, Weinheim, vol. 3. 

Parent Carbohydrases 

20 Parent carbohydrases may be defined as all enzymes capable of breaking down 

carbohydrate chains (e.g. starches) of especially five and six member ring structures (i.e. 
enzymes classified under the Enzyme Classification number E.C. 3.2 (glycosidases) in 
accordance with the Recommendations (1992) of the International Union of Biochemistry and 
Molecular Biology (IUBMB)). 

25 Examples include carbohydrases selected from those classified under the Enzyme 

Classification (E.C.) numbers: 

alfa-amylase (3.2.1.1) alfa-amylase (3.2.1.2), glucan 1,4-alfa-glucosidase (3.2.1.3). cellulase 
(3.2.1.4). endo-1.3(4)-beta-glucanase (3.2.1.6), endo-1 ,4-beta-xylanase (3.2.1.8). dextranase 
(3.2.1.11), chitinase (3.2.1.14). polygalacturonase (3.2.1.15). lysozyme (3.2.1.17), beta- 

30 glucosidase (3.2.1.21), alfa-galactosidase (3.2.1.22). beta-galactosidase (3.2.1.23). amylo- 
1,6-glucosidase (3.2.1.33), xylan 1 ,4-beta-xylosidase (3.2.1.37), glucan endo-1 ,3-beta-D- 
glucosidase (3.2.1.39), alfa-dextrin endo-1 ,6-glucosidase (3.2.1.41). sucrose alfa-glucosidase 
(3.2.1.48), glucan endo-1 ,3-alfa-glucosidase (3.2.1.59), glucan 1,4-beta-glucosidase 
(3.2.1.74). glucan endo-1 ,6-beta-glucosidase (3.2.1.75), arabinan endo-1. 5-alfa-arablnosidase 

35 (3.2.1.99). lactase (3.2.1.108), and chitonanase (3.2.1.132). 

Specific examples of readily available commercial carbohydrases include Alpha-Gal®, 
Bio-Feed® Alpha, Bio-Feed® Beta, Bio-Feed® Plus. Bio-Feed® Plus, Novozyme® 188, 
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Carezyme®, Celluclast®, Cellusoft®. Ceremyl®, Citrozym®, Denimax®, Dezyme®, 
Dextrozyme®, Finizym®, Fungamyl®, Gamanase®, Glucanex®, Lactozym®, Maltogenase®, 
Pentopan®, Pectinex®. Promozyme®. Pulpzyme®, Novamyl®, Termamyl®. AMG 
(Amyloglucosidase Novo), Maltogenase®, Aquazym®. Natalase® (all enzymes available from 
5 Novozymes A/S). Other carbohydrases are available from other companies. 

It is to be understood that also carbohydrase variants are contemplated as the parent 
enzyme. 

The activity of carbohydrases can be determined as described in "Methods of 
Enzymatic Analysis", third edition. 1984, Verlag Chemie, Weinheim, vol. 4. 

10 

Parent Transferases 

Parent transferases (i.e. enzymes classified under the Enzyme Classification number 
E.C. 2 in accordance with the Recommendations (1992) of the International Union of 
Biochemistry and Molecular Biology (IUBMB)) include transferases within this group. 

15 The parent transferases may be any transferase in the subgroups of transferases: 

transferases transferring one-carbon groups (E.C. 2.1); transferases transferring aldehyde or 
residues (E.C 2.2); acyltransferases (E.C. 2.3); glucosyltransferases (E.C. 2.4); transferases 
transferring alkyl or aryl groups, other that methyl groups (E.C. 2.5); transferases transferring 
nitrogeneous groups (2.6). 

20 In a preferred embodiment the parent transferase is a transglutaminase E.C 2.3.2.13 

(Protein-glutamine beta-glutamyltransferase). 

Transglutaminases are enzymes capable of catalyzing an acyl transfer reaction in 
which a gamma-carboxyamide group of a peptide-bound glutamine residue is the acyl donor. 
Primary amino groups in a variety of compounds may function as acyl acceptors with the 
25 subsequent formation of monosubstituted gamma-amides of peptide-bound glutamic acid. 
When the epsilon-amino group of a lysine residue in a peptide-chain serves as the acyl 
acceptor, the transferases form intramolecular or intermolecular gamma-glutamyl-epsilon-lysyl 
crosslinks. 

The parent transglutaminase may be of human, animal (e.g. bovine) or microbial origin. 

30 Examples of such parent transglutaminases are animal derived Transglutaminase, 

FXIIIa; microbial transglutaminases derived from Physarum polycephalum (Klein et al., Journal 
of Bacteriology, Vol. 174, p. 2599-2605); transglutaminases derived from Streptomyces sp., 
including Streptomyces lavendulae, Streptomyces lydicus (former Streptomyces libani) and 
Streptoverticiilium sp., including Streptoverticillium mobaraense. Streptoverticillium 

35 cinnamoneum, and Streptoverticillium griseocameum (Motoki et al., US 5,156,956; Andou et 
al., US 5,252,469; Kaempfer et al., Journal of General Microbiology, Vol. 137, p. 1831-1892; 
Ochi et al., International Journal of Sytematic Bacteriology, Vol. 44, p. 285-292; Andou et al., 
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US 5,252,469; Williams et al.. Journal of General Microbiology, Vol. 129, p. 1743-1813). 

It i s t o b e u nderstood that a Iso transferase v ariants a re c ontemplated as t he p arent 
enzyme. 

The activity of transglutaminases can be determined as described in "Methods of 
5 Enzymatic Analysis", third edition. 1984. Verlag Chemie. Weinheim, vol. 1-10. 

Parent Phytases 

Parent p hytases a re i ncluded i n t he g roup of e nzymes c lassified u nder t he E nzyme 
Classification number E.C. 3.1.3 (Phosphoric Monoester Hydrolases) in accordance with the 
10 Recommendations (1992) of the I nternational U nion of Biochemistry and Molecular Biology 
(IUBMB)). 

Phytases are enzymes produced by microorganisms, which catalyse the conversion of 
phytate to inositol and inorganic phosphorus. 

Phytase producing microorganisms comprise bacteria such as Bacillus subtilis, Bacillus 
15 natto and Pseudomonas; yeasts such as Saccharomyces cerevisiae; and fungi such as 
Aspergillus niger, Aspergillus ficuum, Aspergillus awamori, Aspergillus oryzae, Aspergillus 
terreus or Aspergillus nidulans, and various other Aspergillus species). 

Examples of parent phytases include phytases selected from those classified under the 
Enzyme Classification (E.C.) numbers: 3-phytase (3.1.3.8) and 6-phytase (3.1.3.26). 
20 The activity of phytases can be determined as described in "Methods of Enzymatic 

Analysis", third edition, 1984, Verlag Chemie, Weinheim, vol. 1-10, or may be measured 
according to the method described in EP-A1-0 420 358, Example 2 A. 

Lyases 

25 Suitable lyases include Polysaccharide lyases: Pectate lyases (4.2.2.2) and pectin 

lyases (4.2.2.10). such as those from Bacillus licheniformis disclosed in WO 99/27083. 

Isomerases 

Without being limited thereto suitable protein disulfide isomerases (PDI) include PDIs 
30 described in WO 95/01425 (Novo Nordisk A/S) and suitable glucose isomerases include those 
. described in Biotechnology Letter, Vol. 20. No 6, June 1998, pp. 553-56. 

Contemplated isomerases include xylose/glucose Isomerase (5.3.1.5) including 
SweetzymeD (available from Novozymes A/S). 

35 Antimicrobial peptides. 

In another particular embodiment of the present invention, the secreted protein is an 
anti microbial peptide (AMP). In the context of the present invention AMPs are polypeptides or 
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proteins showing evidence of antimicrobial activity. 

The term "antimicrobial activity" is defined herein as an activity which is capable of 
killing or inhibiting growth of microbial cells. In the context of the present invention the term 
"antimicrobial" is Intended to mean that there is a bactericidal and/or a bacteriostatic and/or 
5 fungicidal and/or fungistatic effect and/or a virucidal effect, wherein the term "bactericidal" is to 
be understood as capable of killing bacterial cells. The term "bacteriostatic" is to be understood 
as capable of inhibiting bacterial growth, i.e. inhibiting growing bacterial cells. The term 
"fungicidal" is to be understood as capable of killing fungal cells. The term fungistatic" is to be 
understood as capable of inhibiting fungal growth, i.e. inhibiting growing fungal cells. The term 
10 "virucidal" is to be understood as capable of inactivating virus. The term "microbial cells" 
denotes bacterial or fungal cells (including yeasts). 

In the context of the present invention the term "inhibiting growth of microbial cells" is 
intended to mean that the cells are in the non-growing state, i.e., that they are not able to 
propagate. 

15 For purposes of the present invention, antimicrobial activity may be determined 

according to the procedure described by Lehrer et at., Journal of Immunological Methods, Vol. 
137 (2) pp. 167-174 (1991). 

Polypeptides having antimicrobial activity may be capable of reducing the number of 
living cells of a microbe selected from the group consisting of Aspergillus fumigatus (CBS 

20 113.26), Candida albicans (ATCC 10231), Saccharomyces cerevisiae (ATCC 9763), 
Trychophyton mentagrophytes (DSM 4870), Pityrosporum (CBS 1878), Epidermophyton 
floccosum (DSM 10709), Aspergillus niger (ATCC 9642) and Fusarium Oxysporum to 1/100 
after 30 min. incubation at 20°C in an aqueous solution of 25%(w/w); preferably in an aqueous 
solution of 10%(w/w); more preferably in an aqueous solution of 5%(w/w); even more 

25 preferably in an aqueous solution of 1%(w/w); most preferably in an aqueous solution of 
0.5%(w/w); and in particular in an aqueous solution of 0.1%(w/w). 

Polypeptides having antimicrobial activity may also be capable of inhibiting the 
outgrowth of a microbe selected from the group consisting of Aspergillus fumigatus (CBS 
113.26), Candida albicans (ATCC 10231), Saccharomyces cerevisiae (ATCC 9763), 

30 Trychophyton mentagrophytes (DSM 4870), Pityrosporum (CBS 1878), Epidermophyton 
floccosum (DSM 10709), Aspergillus niger (ATCC 9642) and Fusarium Oxysporum for 24 
hours at 25*C in a microbial growth substrate, when added in a concentration of 1000 ppm; 
preferably when added in a concentration of 500 ppm; more preferably when added in a 
concentration of 250 ppm; even more preferably when added in a concentration of 100 ppm; 

35 most preferably when added in a concentration of 50 ppm; and in particular when added in a 
concentration of 25 ppm. 



25 



10355.000-DK 



An AMP of the present invention may be obtained from microorganisms of any genus. 
For purposes of the present invention, the term "obtained from" as used herein shall mean that 
the polypeptide encoded by the nucleotide sequence is produced by a cell in which the 
nucleotide sequence is naturally present or into which the nucleotide sequence has been 
5 inserted. In a preferred embodiment, the polypeptide is secreted extracellularly. 

An AMP of the present invention may be a bacterial polypeptide. For example, the 
polypeptide may be a gram positive bacterial polypeptide such as a Bacillus polypeptide, e.g.. 
a Bacillus alkalophifus, Bacillus amyloliquefaciens. Bacillus brevis, Bacillus circulans, Bacillus 
coaguians, Bacillus lautus, Bacillus lentus. Bacillus licheniformis, Bacillus megaterium, Bacillus 
10 stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis polypeptide; or a Streptomyces 
polypeptide, e.g., a Streptomyces lividans or Streptomyces m urinus polypeptide; or a gram 
negative bacterial polypeptide, e.g., an E. coli or a Pseudomonas sp. polypeptide. 

An AMP of the present invention may be a fungal polypeptide, and more preferably a 
yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, 
15 Schizosaccharomyces, or Yarrowia polypeptide; or more preferably a filamentous fungal 
polypeptide such as an Acremonium, Aspergillus, Aureobasidium, Cryptococcus, Filibasidium, 
^Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimasiix, Neurospora, 
Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, 
Tolypocladium, or Trichoderma polypeptide. 
20 In an interesting embodiment, the polypeptide is a Saccharomyces carlsbergensis, 

Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, 
Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis polypeptide. 

In another interesting embodiment, the polypeptide is an Aspergillus aculeatus, 
Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, 
25 Aspergillus niger, Aspergillus oryzae, Fusarium bactridioides, Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium 
heterosporum, Fusarium negundi. Fusarium oxysporum, Fusarium reticulatum, Fusarium 
roseum, Fusarium sambucinum, Fusarium sarcochmum, Fusarium sporotrichioides, Fusarium 
sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola 
30 insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora 
crassa, Penicillium purpurogenum, Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride polypeptide. 

It will be understood that for the aforementioned species, the invention encompasses 
both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs, 
35 regardless of the species name by which they are known. Those skilled in the art will readily 
recognize the identity of appropriate equivalents. 
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Strains of these species are readily accessible to the public in a number of culture 
collections, such as the American Type Culture Collection (ATCC), Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH (DSM), Centraalbureau Voor Schimmelcultures 
(CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional 
5 Research Center (NRRL). 

AMPs encoded by nucleotide sequences of the present invention also include fused 
polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at the N- 
terminus or the C-terminus of the polypeptide or fragment thereof. A fused polypeptide is 
produced by fusing a nucleotide sequence (or a portion thereof) encoding another polypeptide 
10 to a nucleotide sequence (or a portion thereof) of the present invention. 

Methods of production . 

The transformed or transfected host cells described above are cultured in a suitable 

nutrient medium under conditions permitting the production of the desired molecules, after 
1 5 which these are recovered from the cells, or the culture broth. 

The medium used to culture the cells may be any conventional medium suitable for 

growing the host cells, such as minimal or complex media containing appropriate supplements. 

Suitable media are available from commercial suppliers or may be prepared according to 

published recipes (e.g. in catalogues of the American Type Culture Collection). The media are 
20 prepared using procedures known in the art (see, e.g., references for bacteria and yeast; 

Bennett, J.W. and LaSure, L, editors, More Gene Manipulations in Fungi, Academic Press, 

CA, 1991). 

The cells may be cultured in any suitable container-unit, e.g. a shake flask, 24 well 
plates, 96 well plates, 384 well plates, 1536 well plates, or a higher number of wells per plate, 
25 or nanoiiter well-less compartments. 

In order to increase the number of individual activity assays performed in a given time 
the activity may conveniently be assayed in a high-throughput screening system using 96 well 
plates, 384 well plates, 1536 well plates, or a higher number of wells per plate, or nanoiiter 
well-less compartments. Such screening techniques are well known in the art, see e.g. Dove. 
30 A., Nature Biotechnology (17), 1999, 859-863, and Kell, D„ trends in Biotechnology (17), 1999, 
89-91. 

The molecules are recovered from the culture medium by conventional procedures 
including separating the host cells from the medium by centrifugation or filtration, precipitating 
the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. 
35 ammonium sulphate, purification by a variety of chromatographic procedures, e.g. ion 
exchange chromatography, gelfiltration chromatography, affinity chromatography, or the like, 
dependent on the type of molecule in question. 
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The molecules of interest may be detected using methods known in the art that are 
specific for the molecules. These detection methods may include use of specific antibodies, 
formation of a product, or disappearance of a substrate. For example, an enzyme assay may 
be used to determine the activity of the molecule. Procedures for determining various kinds of 

5 activity are known in the art. 

The molecules of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., 
preparative isoelectric focusing (IEF), differential solubility (e.g., ammonium sulfate 

1 0 precipitation), or extraction (see, e.g., Protein Purification, J-C Janson and Lars Ryden, editors. 
VCH Publishers, New York, 1989). 

Relenting recombinant host cells. 

Bacillus transformations are incubated and those exhibiting desired level of reporter 

1 5 gene activity are selected. 

As an example the reporter gene may be 2-fold over expressed in a secretion stressed 
celt compared to a non secretion stressed cell, preferably 5-fold over expressed in a secretion 
stressed cell compared to a non secretion stressed cell, more preferably 10-fold over 
expressed in a secretion stressed cell compared to a non secretion stressed cell, or 20-fold 

20 over expressed in a secretion stressed cell compared to a non secretion stressed cell, most 
preferably 50-fold over expressed in a secretion stressed cell compared to a non secretion 
stressed cell, or more than 100-fold over expressed in a secretion stressed cell compared to a 
non secretion stressed cell. 

25 Material and methods 

Bacterial strains and growth conditions 

The 8ac///us subtilis strain DN3 (Noone et al. 2000) was used in this study. 
Ithas the genotype: trpC2P spac-ykdA P^-tecZ Er/. B. subtiliswere routinely maintained 
and propagated on Luria-Bertani (LB) or supplemented with agar (1.5% wt/voi) as appropriate 

30 and grown at 37°C with aeration. X-Gal (5-bromo-4-chloro-3-indolyl-b-D-galactopyranoside) 
Was added to the media at a concentration of 64ng/ml. and IPTG (isopropyl-b-D- 
thiogalactopyranoside) was added to a final concentration of 0,8 mM. Antibiotics were added at 
the following concentrations: chloramphenicol, 6 ug/ml; and erythromycin, 3 ug/ml. For 
expression studies transformants were grown in PS-1 media for 3 days, 30° C and at 250 rpm, 

35 cells were spun down and the supernatant analysed for secreted recombinant protein on SDS- 
polyacrylamide gels. 
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DNA manipulations 

B. subtilis transformations were performed as described previously (Anagnostopotous, 
C, and J. S pizizen. 1 961. R equirements for t ransformation i n S acillus s ubtilis. J . B acteriol. 
81:741-746). All routine molecular biological procedures were performed according to the 
5 protocols described by Sambrook et al. (1989). 

SDS-Dolvacrvlamide gel electrophoresis 

Equal volume of Laemmli buffer and supernatants from liquid cultures of transformants 
were mixed, and analyzed on SDS-polyacrylamide gels (12%) according to Laemmli (Laemmli, 
10 U.K. (1970) Nature 227, 680-685) 

Screening for reporter aene expression 

Bacillus transformations were plated on Petri dishes with LB-media supplemented with 
agar (1.5% wt/vol) and containing the appropriate antibiotics, X-Gal and IPTG at the above 
15 described concentrations. They were incubated at 37° C overnighL Blue colonies 
(reportergene activity) could be seen either immediately or after up to 24 hours at room 
temperature. 

Expression constructs 

20 Expression constructs were made in either the expression vector pDG268neo (Widner 

B; Thomas M; Sternberg D; Lammon D; Behr R; Sloma A (2000): Development of marker-free 
strains of Bacillus subtilis capable of secreting high levels of industrial enzymes. Journal of 
Industrial Microbiology and Biotechnology, Vol. 25 (4) pp. 204-212) or in a linear integration 
vector. In both ways the final gene construct is integrated on the Bacillus chromosome by 

25 homologous recombination into either the AmyE locus or the pectate lyase locus. Cloning in 
the plasmid pDG268neo was done according to the protocols described by Sambrook et al. 
(1989). The linear integration vector is a PCR fusion product made by fusion of the gene of 
interest between two Bacillus subtils homologous chromosomal regions along with a strong 
promoter and a chloramphenicol resistance marker. The fusion is made by SOE PCR (Horton, 

30 R.M., Hunt, H.D., Ho, S.N., Pullen, J.K. and Pease, L.R. (1989) Engineering hybrid genes 
without the use of restriction enzymes, gene splicing by overlap extension Gene 77: 61-68). 
The promoter consists of the amyL promoter P 41 99 and the amyQ promoter scBAN/cry3A 
long s tabilizer ( the m ethod i s d escribed i n p atent a pplication WO 0 3/00301 ). The c onstruct 
NP000719 was constructed in the linear integration vector: First 3 fragments were PCR 

35 amplified: the gene fragment with specific primers oth48 (SEQ ID NO.:4) and oth50 (SEQ ID 
NO.:5) on genomic DNA from the strain harboring the gene (strain NN01856). The upstream 
flanking fragment was amplified with the primers 260558 (SEQ ID NO.: 6) and oth49 (SEQ ID 
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NO.: 7) and the downstream flanking fragment was amplified with the primers 260559 (SEQ ID 
NO.: 8) and oth51 (SEQ ID NO.: 9) from genomic DNA from the strain iMB1361 (described in 
patent application WO 03/00301). The 3 resulting fragments were mixed in equal molar ratios 
(fragment 1: 400ng, fragment 2: 100ng, fragment 3: 200ng) and a new PCR reaction were run 

5 under the following conditions: initial 2 min at 94°C. followed by 10 cycles of (94"C for 15 sec, 
55°C for 45 sec, 68°C for 5 min.), 5 cycles of (94°C for 15 sec, 55°C for 45 sec, 68°C for 8 
min.), 15 cycles of (94°C for 15 sec, 55°C for 45 sec, 68°C for 8 min. in addition 20 sec. extra 
pr cycle). Two micro L of the PCR product was transformed into the Bacillus WT strain and into 
DN3 and selection was performed as described. The other constructs listed in table 1 are 

10 made in an identical way; the only difference is using other gene specific primers. 

Strain construction 

Strain DN3 was constructed by cloning a ykdA fragment (synthesized with primers 
YKDA6 (SEQ ID NO.: 10) and YKDAP1 (SEQ ID NO.: 11) into pMUTin4 (Vagner V, Dervyn E 

15 and SD Ehrlich (1998) A vector for systematic gene inactivation in Bacillus subtilis. 
Microbiology, Vol 144, 3097-3104) to generate plasmid pDN3. Plasmid pDN3 was then 
integrated into the ykdA gene on the chromosome, of B. subtilis strain 168 by a single 
crossover (Campbell-type event) to yield strain DN3. This integration results in (1) that lacZ 
becomes transcriptionally fused to the ykdA promoter, allowing its expression to be monitored 

20 (2) the native ykdA gene gets under the IPTG inducible Pspac promoter control, so the 
transcription of ykdA can be controlled by IPTG. 

Results 

DN3 was used as host for transformation of 1 2 different expression constructs. The 
25 constructs had previously been analysed in a WT bacillus host, without the possibility for 
secretion selection. For 5 of the constructs it was previously possible to find transformants 
secreting recombinant protein (from 200mg to 2g /L) in the WT host. For 5 of the constructs 
secreted recombinant protein could not be detected among selected transformants in the WT 
host. Two of the constructs were never previously analysed for recombinant protein, as 
30 transformants containing a non mutated gene was never obtained. In table 1 the number of 
white and blue colonies from each transformation is scored from the experiment using DN3 as 
host. Results from the experiment using the WT host are listed as well. The number of blue 
and white colonies reflects the result obtained with the WT host transformants: The constructs 
giving rise to a majority of blue colonies, were also successfully expressed in the WT (giving a 
35 band on SDS gel and containing no mutations). The constructs giving rise to a majority of 
white colonies, were unsuccessfully expressed in the WT host. 
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Five constructs giving rise to protein bands on SDS gels (from 200mg to 2 g/l) gave 90- 
99% blue colonies upon transformation of constructs into the host DN3. Five constructs for 
which it was not possible to find transformants in the WT host producing recombinant protein 
bands, gave 90-99% white colonies upon transformation into the DN3 host. Controls were 
5 transformed with water instead of DNA and this resulted in only white colonies. 

Colonies from several of the constructs have been analyzed further. For 3 of the five 
constructs that were successfully expressed in the WT (giving recombinant protein band on 
SDS g el) a nd giving 9 5-99% b lue colonies i n D N3. b lue a nd w hite c olonies w ere a nalyzed 

10 further. Blue and white colonies were selected for growth in liquid culture and the culture 
supernatants were analyzed for recombinant proteins on SDS gels. All 3 constructs where 
shown to produce recombinant protein of expected size from the blue colonies, but no 
recombinant protein bands were observed from the white colonies. 

Six of the seven construct that were not successfully expressed in the WT and gave 

15 rise to only few blue colonies in DN3 (1-30% blue colonies) were also analyzed further. For 4 
of the 6 constructs it was possible to find transformants giving recombinant protein bands on 
SDS gels among the few blue colonies. 

Table 1. List of constructs that were transformed into a WT host and the DN3 host. 
20 Transformants were analysed for secreted protein in supernatants from liquid cultures by SDS 
gel analysis. The % of blue and white colonies was scored in the DN3 host. In the WT 
experiment, the number of transformants with the right insert of the total number of 
transformants analysed is listed. 



Construct/ 
Enzyme 


Expression In WT-host | 


Expression in DN3 




analysis of 

transformants 

insert/analysed 


band on 
SDS-gel? 


% blue / 

white 

colonies 


Analysis of blue 
colonies 
band on 
SDS-gel? 


Carboxypeptidase 


10/10 


no ! 


1 /99 


No band 

(1 blue colony 
analysed) 


Alginate lyase 




no 


5/95 


Yes 

300mg/L j 
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10 



Protease 


1/10 


no 


5/95 


Ves I 
500mg/L I 


Lipolytic enzyme 




no 


1 /99 


No band I 
(3 light blue colonies! 
analysed) 


Endopeptidase 


9/10 


no 


10/90 


Not analysed | 


Isomaitodextranas 
e 


3/60 


not tested 


10/90 


Yes l 

500mg/L I 


Xylosidase 


12/77 


not tested 


30/70 


Yes "| 

50-1 OOmg | 


XylanaseGHH 


9/10 


yes 
1-2g/l 


90/10 


Not analysed 1 


Metalloprotease 


7/10 


yes 

300mg/l 


99/1 


Yes 1 


Xylanase GH11 


9/10 


yes 

100 mg /I 


95/5 


Yes 1 


Xylanase GH10 


8/10 


yes 
1-2 g/I 


95/5 


Not analysed 1 


amylase 




yes 
1g/l 


95/5 


Yes 1 

1 1 


Water 






0/100 





12 different constructs, including amylases, xylanases, lipases, alginate lyase, 
dextranase and proteases, expressing different levels of product between 200 mg and 2 
gram/liter, have been assessed. In all the cases the blue/white selection system correlates with 
the expression potential of the constructs (blue colonies are secreting Bacillus transformants. 
White colonies are non secreting Bacillus transformants). 

In the traditional expression cloning without secretion selection it is very time 
consuming to identify the transformants expressing and secreting the protein of interest. This 
includes analysis of transformants for the right insert by genome analysis or plasmid analysis 
of normally 5-20 transformants. But in some cases in table 1 we have analysed more than 70 
transformants to find few with the right insert. Liquid cultures of selected transformants are 
fermented, for protein analysis of the supernatant on SDS-gels. This step is quite labour 
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intensive and expensive, so often only a few clones are selected for this analysis. These are 
sometimes non expressing clones and in the traditional expression cloning there is no way to 
select the f. ex 5-10% secreting transformants from the 90 to 95%. The advantage of the 
secretion selection is that in these cases the 5-10 % expressing and secreting transformants 
5 can easily be identified by their blue colony colour on solid media. 
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CLAIMS 

1. A method of screening for protein secreting recombinant host cells comprising 
screening for promoter activity of a stress inducible promoter, 

5 

2. The method according to claim 1 comprising the steps of 

(vi) Providing a host cell comprising the stress inducible promoter operably 
linked to nucleic acid sequence encoding a reporter protein or a regulator 
protein. 

10 (vii)Providing a nucleic acid sequence of interest. 

(viii) Introducing the nucleic acid sequence in (ii) into the host cell in (i) 

<ix) Culturing host ceil obtained in (iii) under conditions promoting secretion of 

the protein encoded by the nucleic acid sequence from (ii); and 
(x) Selecting the host cell exhibiting the desired level of reporter protein 

15 expression. 



3. The method according to claim 2, wherein the regulator protein controls the expression 
of the reporter gene. 

20 4. The method according to claim 3, wherein the regulator protein is an activator or 
repressor of the expression of the reporter protein. 

5. The method according to claim 1 , where the host cell is selected from bacterial cells. 

25 6. The method according to claim 5, where the host cells belong to a strain selected from 
the group consisting of the species Bacillus alkalophilus. Bacillus agaradhaerens, 
Bacillus amyloliquefaciens, Bacillus brevis, Bacillus clausii, Bacillus circulans. Bacillus 
coagulans, Bacillus lautus, Bacillus lentus. Bacillus licheniformis, Bacillus megaterium, 
Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptomyces 

30 lividans, Streptomyces murinus, Escherichia coli, Lactococcus lactis, and 

Pseudomonas putida. 



- 7. The method according to claim 1, where the stress inducible promoter is comprised by 
the nucleic acids 1-999 of SEQ ID NO.:1. 

35 

8. The method according to claim 1, where the stress inducible promoter comprises the 
nucleic acids 1-999 of SEQ ID NO.:1. 
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9. The method according to claim 1, where the stress inducible promoter consists of the 
nucleic acids 1-999 of SEQ ID NO.:1 . 

10. The method a ccording to claim 1 , where the stress i nducibie p romoter i n its normal 
position regulates a gene encoded protein that is a functional homolog of the gene 
encoded protein regulated by the promoter sequence comprised by nucleic acids 1-999 
of SEQ ID NO.:1. 

1 1 . The method a ccording t o c laim 1 . w here t he s tress i nducibie p romoter i n i ts n ormal 
position is the promoter linked to a gene encoding a polypeptide which has at least 
70% identity to the amino acid sequence of SEQ ID NO.:Z 

12. The method according to claim 1, where the stress inducible promoter is the promoter 
linked to a gene encoding a polypeptide which has at least 80% identity to the amino 
acid sequence of SEQ ID NO.:2, or at least 90% identity to the amino acid sequence of 
SEQ ID NO.:2, or at least 95% identity to the amino acid sequence of SEQ ID NO.:2, or 
at least 98% identity to the amino acid sequence of SEQ ID NO.:2. 

13. The method according to claim 1, where the stress inducible promoter is comprised by 
the repeated octarneric motif of SEQ ID NO.: 3. 

14. The method according to claim 1, where the stress inducible promoter comprises the 
repeated octarneric motif of SEQ ID NO.: 3. 

15. The method according to claim 1, where the stress inducible promoter is identical to the 
octarneric motif of SEQ ID NO.: 3. 

16. The method according to claim 2, where the reporter protein is 2-fold over expressed in 
a secretion stressed cell compared to a non secretion stressed cell, preferably 5-fold 
over expressed in a secretion stressed cell compared to a non secretion stressed cell, 
more preferably 10-fold over expressed in a secretion stressed cell compared to a non 
secretion stressed cell, or 20-fold over expressed in a secretion stressed cell compared 
to a non secretion stressed cell, most preferably 50-fold over expressed in a secretion 
stressed cell compared to a non secretion stressed cell, or more than 100-fold over 
expressed in a secretion stressed cell compared to a non secretion stressed cell. 
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17. The method according to claim 2, where the reporter protein is selected from the group 
consisting of fluorescent protein, antibiotic markers, and substrate converting enzymes. 

18. The method according to claim 1, where the stress inducible promoter is comprised by 
5 nucleic a cids 1 -999 o f SEQ I D N O.: 1 , a nd the h ost c etl further comprises a n I PTG- 

inducible promoter operabiy linked to a nucleic acid sequence encoding the amino 
acids 1 to 449 of SEQ ID NO:2. 

19. The method according to claim 1, where the stress inducible promoter comprises 
10 nucleic acids 1-999 of SEQ ID NO.:1, and the host cell further comprises a IPTG- 

inducibte promoter operabiy linked to a nucleic acid sequence encoding the amino 
acids 1 to 449 of SEQ ID NO:2. 

20. The method according to claim 1, where the stress inducible promoter consists of 
15 nucleic acids 1-999 of SEQ ID NO.:1, and the host cell further comprises a IPTG- 

inducible promoter operabiy linked to a nucleic acid sequence encoding the the amino 
acids 1 to 449 of SEQ ID NO:2. 

21. The method according to claim 2. where the nucleic acid sequence in 2(H) encodes a 
20 polypeptide. 

22. The method according to claim 2, where the nucleic acid sequence in 2(H) encodes an 
enzyme. 

25 23. The method according to claim 22, where the enzyme is selected from the group 
consisting of proteases, cellulases (endoglucanases), beta-glucanases, hemicellulases, 
lipases, peroxidases, laccases, alfa-amylases, glucoamylases, cutinases, pectinases, 
reductases, oxidases, phenoloxidases, ligninases, pullulanases, pectate lyases, 
xyloglucanases, xylanases, pectin acetyl esterases, polygalacturonases, 

30 rhamnogalacturonases, pectin lyases, mannanases, pectin methylesterases, cello- 

biohydrolases, transglutaminases and phytases. 

24. The method a ccording t o c laim 2 1 , w here t he nucleic a cid s equence i s o btained b y 
mutating a nucleic acid sequence encoding a polypeptide. 

35 

25. The method according to claim 21, where the nucleic acid sequence is obtained by 
mutating a nucleic acid sequence encoding a protein engineered polypeptide. 
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ABSTRACT 



The invention describes a method of screening for protein secreting recombinant host ceils. 
The method can be used for rapid identification of actively secreting transformants and can be 
5 used to screen recombinant libraries for transformants secreting proteins. 
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<110> Novozymes A/S 

<120> Detection system for high expressing recombinant Bacillus 

<130> 10355 

<160> 11 

<170> Patentln version 3.2 

<210> 1 

<211> 2349 

<212> DNA 

<213> Bacillus subtilis 
<220> 

<221> YkdA_protein 

<222> C1000)..(2349) 

<223> 1-999 is promoter 

<220> 

<221> CDS 

<222> (1000) . . (2349) 

tctttcaagg attcatgttt gtttaccaac ctgttctgta agcagttcat attttctcag 
ggttctttca aatacctcat caaaaacgtc cggcacagag gcgtgtatca cctcagctcc 
ctctcccgtt attccgcctt tggttgcaac acgttccagc gtctcctcga aagacatatt 
tttttcgatc agcatcttgc cagttccgta tagcgaatga atcagaaaat caaaggcttc 
ttctttggac aggctgctgt ttctgacggc agacagtgcc agttcttcaa agattgcagc 
tatgaatccc ggtgccgagc tcgttaaatt gctggccaca tctaaattcg attctttgat 
ttcccgtaca cggctgaaaa ccgataacaa ttcattcaga cgttcttttt tctctgcagc 
cagtgcttca ctgtgaacga caagtgagat gccggcttct gcttcggacg taatggcagg 
aataacacgt gagataccgg cttctgtttc tgcctccaaa agacgcagcg gcacaccggc 
agctatggat acgatgtgag tatttctgtt cacatacggg tacagacggc gcattgtttc 
gatgacatgg agtggcggga cgcatatcaa aatcaattgg cacgtatttg cccaattctc 
caacggatca gccgatacgt ttggataatc tgacatgagt gcccgcagcc gctccccctt 
cgttctcgtc tcaataaata gctcattctc ttttatttgt tcatgtttca acagctgtct 
agcgatcata tccgccatgc tgccatatcc aatcaatcca atctgttcca tcgactcagt 
cctttcatat acaatatgaa gtgtaccgtt ttccgcactt tttcacaatt tcccataatc 
ttttcatttt tatcccacag tttttgttta tgataaactc aagtcataaa cctatcaata 
taaatagaca tgtgaaaata gagaaacgga gtgaacatg atg gat aac tat cgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 
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gag aac gat cag age gec tec tac teg gec cgc aat gtc att cat gat lliu 
Glu Asn Asp Gin ser Ala Ser Tyr ser Ala Arg Asn Val lie His Asp 
25 30 35 

cag gag aag aaa aaa cga gga ttc gga tgg ttc aga ccg ttg ctt ggc 1158 
Gin Glu Lys Lys Lys Arg GTy Phe G?y Trp Phe Arg Pro Leu Leu GTy 
40 45 50 

gga gtg ate ggc ggc agt ctt get ctt ggc att tac acg ttt aca ccg 1206 
Gly val lie Gly cly Ser Leu Ala Leu GTy lie Tyr Thr Phe Thr Pro 
55 60 65 

ctt ggt aac cat gat tct cag gac act gca aaa caa tea tec age cag 1254 
Leu Gly Asn His Asp Ser Gin Asp Thr Ala Lys Gin ser ser Ser Gin 
70 75 80 85 

cag caa acg caa tct gtt aca gca aca age acc tec tct gaa tct aaa 
Gin Gin Th? Gin Ser Val Thr Ala Thr ser Thr ser ser Glu ser Lys 
90 95 100 

aaa age tea age age tea tct gca ttc aag age gag gac tct tct aaa 
Lys ser Ser Ser ser ser ser Ala Phe Lys Ser Glu Asp ser ser Lys 
105 110 H5 

ate tea gat atg gta gaa gac ctt tea cca gcg att gtc ggt att aca 
He ser Asp Met val Glu Asp Leu ser Pro Ala He val Gly lie Thr 
120 125 130 

aat ctt cag gca caa tea aac age tct ttg ttc ggc tct agt tct tct 
Asn Leu Gin Ala Gin Ser Asn Ser Ser Leu Phe Gly ser ser ser ser 
135 140 145 

gat tec age gaa gat aca gaa age ggt tea gag tea ggt gtc att ttc 
Asp Ser Ser Glu Asp Thr Glu Ser Gly ser Gly ser Gly val lie Phe 
150 155 160 165 

aaa aaa gag aat ggc aag get tat ate att aca aat aac cac gtc gta 
Lys Lys Glu Asn Gly Lys Ala Tyr He He Thr Asn Asn His val val 
170 175 180 

gaa ggg gca tea tea ctg aag gta tct tta tat gac ggc act gag gtt 
Glu Gly Ala ser Ser Leu Lys val Ser Leu Tyr Asp Gly Thr Glu val 
185 190 195 

act gca aag ctg gta ggc agt gac teg tta act gat tta gee gtc etc 
Thr Ala Lys Leu val Gly ser Asp Ser Leu Thr Asp Leu Ala val Leu 
200 205 210 



1302 
1350 
1398 
1446 
1494 
1542 
1590 
1638 
1686 



caa ate agt gat gac cac gtc aca aaa gtg gca aac ttc ggt gat tea 
Gin He Ser Asp Asp His Val Thr Lys val Ala Asn Phe Gly Asp Ser 
215 220 225 

tct gat ctt aga aca ggc gag acc gtt att gcg att ggg gat ccg ctt 1734 
ser Asp Leu Arg Thr Gly Glu Thr val He Ala He Gly Asp Pro Leu 
230 235 240 245 

gga aaa gac ctg tec cgc aca gta aca caa gga att gta age ggc gtg 
Gly Lys Asp Leu ser Arg Thr val Thr Gin Gly lie val Ser Gly val 
. 250 255 260 

gac aga acg gtt tea atg tct aca tea gec ggc gaa acg age att aac 
asp Arg Thr val Ser Met ser Thr Ser Ala Gly Glu Thr Ser He Asn 
265 270 275 

gtc att cag aca gac gca gca att aat cca ggt aac age ggc ggt cct 
val He Gin Thr asp Ala Ala He Asn pro Gly Asn Ser Gly Gly Pro 
280 285 290 
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1830 
1878 



1926 
1974 
2022 
2070 
2118 
2166 
2214 
2262 
2310 
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ttg tta aat aca gac ggc aaa att gtc ggc att aac agt atg aaa ate 
Leu Leu Asn Thr Asp Giy Lys lie val Gly lie Asn ser Met Lys lie 
295 300 305 

agt gag gat gat gtt gag ggt ate gga ttc gec att cca age aat gac 
ser Glu Asp Asp val Glu Gly He Gly Phe Ala He Pro Ser Asn Asp 
310 315 320 325 

gta aaa ccg att get gaa gaa ttg ctg tct aaa gga caa att gaa cgt 
val Lys Pro lie Ala Glu Glu Leu Leu Ser Lys Gly Gin lie Glu Arg 
330 335 340 

cca tat ate ggt gtc age atg ctt gat eta gag caa gtg ccg caa aat 
pro Tyr lie Gly val Ser Met Leu Asp Leu Glu Gin val Pro Gin Asn 
345 350 355 

tac caa gaa ggc aca etc ggc ctg ttc ggc age cag ctg aat aaa ggc 
Tyr Gin Glu Gly Thr Leu Gly Leu Phe Gly ser Gin Leu Asn Lys Gly 
360 365 370 

gtt tac ate cgt gag gtc get tea ggc tct cct get gaa aag gee gga 
val Tyr lie Arg Glu val Ala ser Gly Ser Pro Ala Glu Lys Ala Gly 
375 380 385 

tta aaa gcg gag gat att ate ate ggc eta aaa ggt aaa gaa att gat 
Leu Lys Ala Glu Asp lie He lie Gly Leu Lys Gly Lys Glu lie Asp 
390 395 400 405 

aca ggc agt gaa ttg cgc aat ate tta tat aaa gac gca aag ate ggt 
Thr Gly ser Glu Leu Arg Asn lie Leu Tyr Lys Asp Ala Lys lie Gly 
410 415 420 

gat acc gtt gaa gtg aaa att etc cga aac ggc aaa gaa atg acg aaa 
Asp Thr val Glu val Lys He Leu Arg Asn Gly Lys Glu Met Thr Lys 
425 430 435 

aaa att aaa ctg gat caa aaa gaa gag aaa act teg taa 2349 
Lys He Lys Leu Asp Gin Lys Glu Glu Lys Thr ser 
440 445 

<210> 2 
<211> 449 
<212> PRT 

<213> Bacillus subtilis 
<400> 2 

Met Asp Asn Tyr Arg Asp Glu Asn Arg Thr Lys Gly Asn Glu Asn Glu 
15 10 15 

val Phe Leu Thr Lys Glu Asn Asp Gin ser Ala ser Tyr Ser Ala Arg 
20 25 30 

Asn val lie His Asp Gin Glu Lys Lys Lys Arg Gly Phe Gly Trp Phe 
35 40 45 

Arg Pro Leu Leu Gly Gly Val lie Gly Gly Ser Leu Ala Leu Gly lie 
50 55 60 

Tyr Thr Phe Thr Pro Leu Gly Asn His Asp ser Gin Asp Thr Ala Lys 
65 70 75 80 
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Gin ser Ser Ser Gin Gin Gin Thr Gin Ser val Thr Ala Thr Ser Thr 
85 90 95 

ser ser Glu ser Lys Lys Ser Ser ser ser ser ser Ala Phe Lys Ser 
100 105 110 

Glu asp ser ser Lys He Ser Asp Met val Glu Asp Leu ser Pro Ala 
115 120 125 

lie val Gly He Thr Asn Leu Gin Ala Gin Ser Asn Ser ser Leu Phe 
130 135 140 

Gly Ser Ser ser Ser Asp Ser Ser Glu Asp Thr Glu ser Gly ser Gly 
145 150 155 -">" 

ser Gly val lie Phe Lys Lys Glu Asn Gly Lys Ala Tyr He lie Thr 
165 l'O 3 

Asn Asn His val val Glu Gly Ala ser ser Leu Lys val ser Leu Tyr 
180 185 19° 

asp Gly Thr Glu val Thr Ala Lys Leu val Gly ser Asp Ser Leu Thr 
195 200 205 

Asp Leu Ala val Leu Gin He ser Asp Asp His val Thr Lys val Ala 
210 215 220 

Asn Phe Gly Asp Ser Ser Asp Leu Arg Thr Gly Glu Thr Val lie Ala 
225 230 235 240 

lie Gly Asp Pro Leu Gly Lys Asp Leu ser Arg Thr val Thr Gin Gly 
245 250 255 

He val Ser Gly Val Asp Arg Thr val ser Met ser Thr ser Ala Gly 
260 265 Z'U 

Glu Thr ser lie Asn val He Gin Thr Asp Ala Ala lie Asn Pro Gly 
275 280 285 

Asn ser Gly Gly Pro Leu Leu Asn Thr Asp Gly Lys lie val Gly He 
290 295 300 

Asn ser Met Lys lie Ser Glu Asp Asp val Glu Gly He Gly Phe Ala 
305 310 315 320 

■w 

He Pro ser Asn Asp val Lys Pro He Ala Glu Glu Leu Leu ser Lys 
325 330 335 

Gly Gin lie Glu Arg Pro Tyr He Gly val ser Met Leu Asp Leu Glu 
340 ~ 345 350 
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Gin val Pro Gin Asn Tyr Gin Glu Gly Thr Leu Gly Leu Phe Gly Ser 
355 360 303 

Gin Leu Asn Lys Gly val Tyr He Arg Glu val Ala ser Gly ser Pro 
370 375 

Ala Glu Lys Ala Gly Leu Lys Ala Glu Asp lie He He Gly Leu Lys 
385 3 90 395 

Gly Lys Glu lie Asp Thr Gly Ser Glu Leu Arg Asn He Leu T^r Lys 

Asp Ala Lys He Gly Asp Thr Val Glu Val Lys He Leu Arg Asn Gly 
420 

Lys Glu Met Thr Lys Lys He Lys Leu Asp Gin Lys Glu Glu Lys Thr 
* 435 440 

ser 



<210> 3 

<211> 8 

<212> DNA 

<213> Artificial 

<220> . 

<223> octamenc motif 



<220> 

<221> octamenc_motif 
<222> CD • • (8) 

<400> 3 
ttttcata 



<210> 4 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<22l> primer_oth48 
<222> CD • • C44) 

<400> 4 

gttcatcgat cgcatcggct aatcagacca cttcgggtga aggc 

<210> 5 

<211> 50 

<212> DNA . 

<213> Artificial 

<220> 
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<220> 

<221> Pnmer_oth50 
<222> CD.-C50) 



ggaljcggltt gaacatgcga ttaaatatcc ttcgagacat tttcgatcgc 



<210> 6 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<221> Primer_260558 
<222> (1)-.(21) 

<400> 6 

gagtatcgcc agtaaggggc g 



<210> 7 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



gccttcaccc gaagtggtct gattagccga tgcgatcgat gaac 



<210> 8 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<221> Primer_260559 
<222> (1)..C23) 

<400> 8 

gcagccctaa aatcgcataa age 



<210> 9 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<221> Primer_oth51 

page 6 



10355. ST25 

<222> (1)..(50) 

gcgatcgaaa atgtctcgaa ggatatttaa tcgcatgttc aatccgctcc 

<210> 10 

<211> 28 

<212> DNA , 

<213> Artificial 

<220> 

<223> primer 



<220> 

<221> Primer_YKDA6 
<222> (1) . • (28) 

<400> 10 

gcgaattcta aactcaagtc ataaacct 



<210> 11 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 



<220> 

<221> Primer_YKDAPl 
<222> (1D-.C26) 

<400> 11 

gcggatccga tgatgaatga cattgc 
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